SEQUENCE LISTING 




<110> Yu, De-Chao 
Li , Yuanhao 
Henderson, Daniel R. 

<120> CELL- SPECIFIC ADENOVIRUS VECTORS 

COMPRISING 2^ INTERNAL RIBOSOME ENTRY SITE 



<130> 348022001700 

<140> 09/814,351 
<141> 2001-03-21 

<150> 60/192,156 
<151> 2000-03-24 

<160> 35 

<170> FastSEQ for Windows Version 4 . 0. . . 

<210> 1 
<211> 519 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> IRES from encephelomycarditis virus (EMCV) 
<400> 1 

gacgtcgact aattccggtt attttccacc atattgccgt cttttggcaa tgtgagggcc 60 
cggaaacctg gccctgtctt cttgacgagc attcctaggg gtctttcccc tctcgccaaa 120 
ggaatgcaag gtctgttgaa tgtcgtgaag gaagcagttc ctctggaagc ttcttgaaga- 180 
caaacaacgt ctgtagcgac cctttgcagg cagcggaacc ccccacctgg cgacaggtgc 240 
ctctgcggcc aaaagccacg tgtataagat acacctgcaa aggcggcaca accccagtgc 300 
cacgttgtga gttggatagt tgtggaaaga gtcaaatggc tctcctcaag cgtattcaac 360 
aaggggctga aggatgccca gaaggtaccc cattgtatgg gatctgatct ggggcctcgg 420 
tgcacatgct ttacatgtgt ttagtcgagg ttaaaaaacg tctaggcccc ccgaaccacg 480 
gggacgtggt tttcctttga aaaacacgat gtcgacgtc 519 

<210> 2 
<211> 188 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> IRES from vascular endothelial growth- factor 
(VEGF) 



<400> 2 



acgtagtcga cagcgcagag gcttggggca gccgagcggc agccaggccc cggcccgggc 

ctcggttcca gaagggagag gagcccgcca aggcgcgcaa gagagcgggc tgcctcgcag 

tccgagccgg agagggagcg cgagccgcgc cggccccgga cggcctccga aaccatggtc 
gacacgta 



60 
120 
180 
188 



0 



<210> 3 
<211> 341 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5' UTR region of HCV 



<400> 3 

gccagccccc tgatgggggc gacactccgc catgaatcac tcccctgtga ggaactactg 60 

tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 

cccccctccc gggagagcca tagtggtGtg cggaaccggt gagtacaccg gaattgccag 180 

gacgaccggg tcctttcttg gattaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 

gcaagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 

gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac c 341 



<210> 4 
<211> 595 
<212> DNA 

<213> TVrtificial Sequence 
<220> 

<223> 5' tJTR region of BiP 



<400> 4 

cccggggtca ctcctgctgg acctactccg accccctagg ccgggagtga aggcgggact 60 

tigtgcggt^a ccagcggaaa tgcctcgggg tcagaagtcg caggagagat agacagctgc 120 

tgaaccaatg ggaccagcgg atggggcgga tgttatctac cattggtgaa cgttagaaac 180 

gaatagcagc caatgaatca gctggggggg cggagcagtg acgtttattg cggagggggc 240 

cgcttcgaat cggcggcggc cagcttggtg gcctgggcca atgaacggcc tccaacgagc 300 

agggccttca ccaatcggcg gcctccacga cggggctggg ggagggtata taagccgagt 360 

aggcgacggt gaggtcgacg ccggccaaga cagcacagac agattgacct attggggtgt 420 

ttcgcgagtg tgagagggaa gcgccgcggc ctgtatttct agacctgccc ttcgcctggt 480 

tcgtggcgcc ttgtgacccc gggcccctgc cgcctgcaag tcgaaattgc gctgtgctcc 540 

tgtgctacgg cctgtggctg gactgcctgc tgctgcccaa ctggctggca agatg 595 



<210> 5 
<211> 575 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> 5* UTR of PDGF 



<400> 5 

gtttgcacct ctccctgccc gggtgctcga gctgccgttg caaagccaac tttggaaaaa 60 

gttttttggg ggagacttgg gccttgaggt gcccagctcc gcgctttccg attttggggg 120 

ctttccagaa aatgttgcaa aaaagctaag ccggcgggca gaggaaaacg cctgtagccg 180 

gcgagtgaag acgaaccatc gactgccgtg ttccttttcc tcttggaggt tggagtcccc 240 
tgggcgcccc cacaccccta gacgcctcgg ctggttcgcg acgcagcccc ccggccgtgg , 300 

atgctgcact cgggctcggg atccgcccag gtagccggcc tcggacccag gtcctgcgcc 360 

caggtcctcc cctgcccccc agcgacggag ccggggccgg gggcggcggc gccgggggca 420 

tgc999tgag ccgcggctgc agaggcctga gcgcctgatc gccgcggacc tgagccgagc 480 

ccacccccct ccccagcccc ccaccctggc cgcgggggcg gcgcgctcga tctacgcgtc 54 0 

cggggccccg cggggccggg cccggagtcg gcatg 575 



<210> 6 
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<211> 2240 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Human uroplakin II 5' flanking region 
<400> 6 

tcgataggta cccactatag ggcacgcgtg gtcgacggcc cgggctggtc tggcaacttc 60 

aagtgtgggc ctttcagacc ggcatcatca gtgttacggg gaagtcacta ggaatgcaga 120 

attgattgag cacggtggct cacacctgta atcccaacac tctgggaggc caaggcaggt 180 

ggatcacttg feggtcaggag tttgagacca gcctggccaa catggtgaaa cctcatctct 240 

actaaaaata caaaaattag ctgggaatgg tggcacatgc ctataatccc agttactcag 300 

gaggctgagg caggagaatc atttgaacct gggaggcaga ggttgcagtg agccgagatc 360 

acgccactgc actccagcct gggtgacaca gcgagactct gtctcaaaaa aaaaaaaatg 420 

cagaatttca ggcttcaccc cagacccact gcatgactgc atgagaagct gcatcttaac 480 

aagatccctg gtaattcata cgcatattaa atttggagat gcactggcgt aagaccctcc 540 

tactctctgc ttaggcccat gagttcttcc tttactgtca ttctccactc accccaaact 600 

ttgagcctac ccttcccacc ttggcggtaa ggacacaacc tccctcacat tcctaccagg 660 

accctaagct tccctgggac tgaggaagat agaatagttc gtggagcaaa cagatataca 720 

gcaacagtct ctgtacagct ctcaggcttc tggaagttct acagcctctc ccgacaaagt 780 

attccacttt ccacaagtaa ctctatgtgt ctgagtctca gtttccactt ttctctctct 84 0 

ctctctctct caactttctg agacagagtt tcacttagtc gcccaggctg gagtgcaggg 900 

gcacaatctc ggctcactgc aacctccacc tcctgggttc aagtgtttct cctgtctcag 960 

cctcccgagt agctgggatt acaggcacac accaccgcgt tagtttttgt atttttggta 1020 

gagatggtgt ttcgccatat tggccaggct gatctcgaac tcctgacctc aggtgatccg 1080 

cccacctcgg cctcccaaag tgctgggatt acaggcatga gccaccacgc ccggctgatc 1140 

tcttttctat tttaatagag atcaaactct ctgtgttgcc taggctggtc ttgaactcct 1200 

ggcctcgagt gatcctccca ccttggcctc ccaaagtgtt gagattacag gcatgagcca 1260 

ctgtgcctgg cctcagttct actacaaaag gaagccagta ccagctacca cccagggtgg 1320 

ctgtagggct acaatggagc acacagaacc cctacccagg gcccggaaga agccccgact 1380 

cctctcccct ccctctgccc agaactcctc cgcttctttc tgatgtagcc cagggccgga 1440 

ggaggcagtc agggaagttc tgtctctttt tcatgttatc ttacgaggtc tcttttctcc 1500 

attctcagtc caacaaatgg ttgctgccca aggctgactg tgcccacccc caacccctgc 1560 

tggccagggt caatgtctgt ctctctggtc tctccagaag tcttccatgg ccaccttcgt 1620 

ccccaccctc cagaggaatc tgaaaccgca tgtgctccct ggcccccaca gcccctgcct 1680 

ctcccagagc agcagtacct aagcctcagt gcactccaag aattgaaacc ctcagtctgc 1740 

tgcccctccc caccagaatg tttctctccc attcttaccc actcaaggcc ctttcagtag 1800 

ccccttggag tattctcttc ctacatatca gggcaacttc caaactcatc acccttctga 1860 

ggggtggggg aaagaccccc accacatcgg gggagcagtc ctccaaggac tggccagtct 1920 

ccagatgccc gtgcacacag gaacactgcc ttatgcacgg gagtcccaga agaaggggtg 1980 

atttctttcc ccaccttagt tacaccatca agacccagcc agggcatccc ccctcctggc 2040 

ctgagggcca gctccccatc ctgaaaaacc tgtctgctct ccccacccct ttgaggctat 2100 

agggcccaag gggcaggttg gactggattc ccctccagcc cctcccgccc ccaggacaaa 2160 

atcagccacc ccaggggcag ggcctcactt gcctcaggaa ccccagcctg ccagcaccta 2220 

ttccacctcc cagcccagca 2240 

<210> 7 
<211> 3592 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Mouse uroplakin II 5' flanking region 
<400> 7 

ctcgaggatc tcggccctct ttctgcatcc ttgtcctaaa tcattttcat atcttgctag 60 
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....) ■) 

acctcagttt gagagaaacg aaccttctca ttttcaagtt gaaaaaaaaa agaggttcaa 120 

^gtggctcac tcaaagttac aagccaacac tcaccactac gagtacaatg gccaccatta 180 

gtgctggcat gccccaggag acaggcatgc atattattct agatgactgg gaggcagagg 240 

ggtggcctag tgaggtcaga ctgtggacag atcaggcaga tgtgggttct gatcccaatt 300 

cctcaggccg cagaactact gtggttcaag aaggggacaa aaggactgca gtccggaaca 360 

ggaggtccat ttgagagctg actgagcaga agaggaaagt gaagaacttc tggggcaaga 420 

gcttacccta ctttacagct ttgttgtctt ctttactcca ggggcgtccc tggtactcag 480 

taaatgtctg ttggcttgag gaacatatgt gtaaggagga aggagaggga acttgaggga 540 

gttaagactc aagaatcaat caaggagagg acagcagaga agacagggtt tgggagagag 600 

actccagaca ttggccctgg ttcccttctt ggccactgtg aaaccctcca gaggaactga 660 

gtgctgtggc tttaaatgat ctcagcactg tcagtgaagc gctctgctca aagagttatc 72 0 

ctcttgctcc tgtgccgggg cctccccctc ctctcagctc ccaaaGcctt ctcagccact 780 

gtgatggcat aattagatgc gagagctcag accgtcaggt ctgctccagg aaccacccat 840 

tttccccaac cccagagaaa ggtcctagtg gaaaagtggg ggccactgaa gggctgatgg 900 

ggttctgtcc tttcccccat gctgggtgga cttaaagtct gcgatgtgtg tagggggtag 960 

aagacaacag aacctggggg ctccggctgg gagcaggagg aactctcacc agacgatctc 1020 

caaatttact gtgcaatgga cgatcaggaa actggttcag atgtagcttc tgatacagtg 1080 

ggtctgaggt aaaacccgaa acttaatttc tttcaaaaat ttaaagttgc atttattatt 1140 

ttatatgtgt gcccatatgt gtgccacagt gtctatgtgg aggtcagagg gcaagttgtg 1200 

ggcattggct ctctcctttc ataatgtggc ttctggggac caaaatgtca ggcatggtgg 1260 

caagagcttt tacctgttga gccatctcat ggtttcgtaa aacttcctat gacgcttaca 1320 
ggtaacgcag agacacagac tcacatttgg agttagcaga tgctgtattg gtgtaaacac ... 1380 

tcatacacag acacacacac atactcatac acacacacac acacttatca catgcacaca 1440 

catactcgtia tacacacaga cacacacaca tgcact:ct:ca cattcacata ttcatacaca 1500 

t:ccacacaca cactcatcca cacacacaga cacacatact catccacaca cacacacaca 1560 

catactcata cacacacaca gacacacata ct:cat:acaca cacacagaca cacacatata 1620 

atcatacata cacagacaca ctcatacatg tgcacacaca cactcatcca cacacacaca 1680 

ctcatacaca cacacactca tacacacaca cactcataca cacacacacg aggtttttct 174 0 

caggctgcct ttgggtggag actggaactg atttctgttt ttcagctcct tggctttttg 1800 

tccctttaga tgagatctcc tcctcacttt acacacagaa agatcacaca cgagggagaa 1860 

ctggcggtgc ggaagagggc tacacggtag ggtgtcaggg tcaggagatc ttcctggcaa 1920 

gtctcaaacc tccacatagc acagtgttta cgtgaggatt taggaggaat caggaagagg 1980 

attggtttac tgcagagcag accatatagg tccactccta agccccattt gaaattagaa 2 040 

gtgagacagt gtgggataaa aagagcagat ctctggtcac atttttaaag ggatatgagg 2100 

gtcctgtgcc tttaagcctt cccatctccc tccaatccce cetcaccttc cccaccctaa 2160 

ccctccccag gtttctggag gagcagagtt gcgtcttctc cctgccctgc cgagctgctc 2220 

actggctgct ctagaggctg tgctttgcgg tctccatgga aaccattagt tgctaagcaa 2280 
ctggagcatc atctgtgctg agctcaggtc ctatcgagtt cacctagctg agacacccac . 2340 

gcccctgcag ccactttgca gtgacaagcc tgagtctcag gttctgcatc tataaaaacg 2400 

agtagccttt caggagggca tgcagagccc cctggccagc gtctagagga gaggtgactg 2460 

agtggggcca tgtcactcgt ccatggctgg agaacctcca tcagtctccc agttagcctg 2520 

gggcaggaga gaaccagagg agctgtggct gctgattgga tgatttacgt acccaatctg 2580 

ttgtcccagg catcgaaccc cagagcgacc tgcacacatg ccaccgctgc cccgccctcc 2640 

acctcctctg ctcctggtta caggattgtt ttgtcttgaa gggttttgtt gttgctactt 2 700 

tttgctttgt tttttctttt ttaacataag gtttctctgt gtagccctag ctgtcctgga 2 760 

actcactctg tagaccaggc tggcctcaaa ctcagaaatc caccttcctc ccaagtgctg 2 82 0 

ggattaaagg cattcgcacc atcgcccagc ccccggtctt gtttcctaag gttttcctgc 2880 

tttactcgct acccgttgca caaccgcttg ctgtccaagt ctgtttgtat ctactccacc 2 940 

gcccactagc cttgctggac tggacctacg tttacctgga agccttcact aacttccctt. 3000 

gtctccacct tctggagaaa tctgaaggct cacactgata ccctccgctt ctcccagagt 3060 

cgcagtttct taggcctcag ttaaatacca gaattggatc tcaggctctg ctatccccac 312 0 

cctacctaac caaccccctc ctctcccatc cttactagcc aaagcccttt caacccttgg 3180 

ggcttttcct acacctacac accagggcaa ttttagaact catggctctc ctagaaaacg 3240 

cctacctcct tggagactga ccctctacag tccaggaggc agacactcag acagaggaac 3300 

tctgtccttc agtcgcggga gttccagaaa gagccatact cccctgcaga gctaactaag 336 0 

ctgccaggac ccagccagag catccccctt tagccgaggg ccagctcccc agaatgaaaa 34 2 0 

acctgtctgg ggcccctccc tgaggctaca gtcgccaagg ggcaagttgg actggattcc 3480 
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cagcagcccc tcccactccg agacaaaatc agctaccctg gggcaggcct cattggcccc 3540 
aggaaacccc agcctgtcag cacctgttcc aggatccagt cccagcgcag ta 3592. 

<2X0> 8 
<211> 822 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> APF-TRE 



<400> 8 

gcattgctgt gaactctgta cttaggacta aactttgagc aataacacac atagattgag 60 

gattgtttgc tgttagcata caaactctgg . ttcaaagctc ctctttattg cttgtcttgg 120 

aaaatttgct gttcttcatg gtttctcttt tcactgctat ctatttttct caaccactca 180 

catggctaca ataactgtct gcaagcttat gattcccaaa tatctatctc tagcctcaat 24 0 

cttgttccag aagataaaaa gtagtattca aatgcacatc aacgtctcca cttggagggc 300 

ttaaagacgt ttcaacatac aaaccgggga gttttgcctg gaatgtttcc taaaatgtgt 360 

cctgtagcac atagggtcct cttgttcctt aaaatctaat tacttttagc ccagtgctca 420 

tcccacctat ggggagatga gagtgaaaag ggagcctgat taataattac actaagtcaa 480 

taggcataga gccaggactg tttgggtaaa ctggtcactt tatcttaaac taaatatatc 540 

caaaactgaa catgtactta gttactaagt ctttgacttt atctcattca taccactcag 600 

ctttatccag gccacttatg agctctgtgt ccttgaacat aaaatacaaa taaccgctat 660 

gctgttaatt atitggcaaat gtcccattitt: caacctiaagg aaataccata aagtaacaga 720 

tataccaaca aaaggttact agttaacagg cattgcctga aaagagtata aaagaatttc 780 

agcatgattt tccatattgt gcttccacca ctgccaataa ca 822 



<210> 9 
<211> 451 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Probasin-TRE 



<400> 9 



aag 


ctt 


cca 


caa 


gtg 


cat 


tta 


gcc 


tct 


cca 


gta 


ttg 


ctg 


atg 


aat 


cca 


48 


cag 


ttc 


agg 


ttc 


aat 


ggc 


gtt 


caa 


aac 


ttg 


ate 


aaa 


aat 


gac 


cag 


act 


96 


tta 


tat 


tta 


cac 


caa 


cat 


eta 


tct 


gat 


tgg 


agg 


aat 


gga 


taa 


tag 


tea 


144 


tea 


tgt 


tta 


aac 


ate 


tac 


cat 


tec 


agt 


taa 


gaa 


aat 


atg 


ata 


gca 


tct 


192 


tgt 


tot 


tag 


tct 


ttt 


tct 


taa 


tag 


gga 


cat 


aaa 


gcc 


cac 


aaa 


taa 


aaa 


240 


tat 


gcc 


tga 


aga 


atg 


gga 


cag 


gca 


ttg 


ggc 


att 


gtc 


cat 


gee 


tag 


taa 


288 


agt 


act 


cca 


aga 


acc 


tat 


ttg 


tat 


act 


aga 


tga 


cac 


aat 


gtc 


aat 


gtc 


336 


tgt 


gta 


caa 


ctg 


cca 


act 


ggg 


atg 


caa 


gac 


act 


gcc 


cat 


gcc 


aat 


cat 


384 


cot 


gaa 


aag 


cag 


eta 


taa 


aaa 


gca 


gga 


age 


tac 


tct 


gca 


cct 


tgt 


cag 


432 


tag 


gtc 


cag 


ata 


cct 


aca 


g 




















451 



<210> 10 
<211> 546 
<212> DNA 

<213> Artificial Sec[uence 



<220> 

<22 3> Tyros inase-TRE 
<400> 10 

ccggttgaaa atgataagtt gaattctgtc ttcgagaaca tagaaaagaa ttatgaaatg 60 
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ccaacatgtg gttacaagta atgcagaccc aaggctcccc agggacaaga agtcttgtgt 120 

taactctttg tggctctgaa agaaagagag agagaaaaga ttaagcctcc ttgtggagat 180 

catgtgatga cttcctgatt ccagccagag cgagcatttc catggaaact . tctcttcctc .240- 

ttcactcgag attactaacc ttattgttaa tattctaacc ataagaatta aactattaat 300 

ggtgaataga gtttttcact ttaacatagg cctatcccac tggtgggata cgagccaatt 360 

cgaaagaaaa agtcagtcat gtgcttttca gaggatgaaa gcttaagata aagactaaaa 420 

gtgtttgatg ctggaggtgg gagtggtatt atataggtct cagccaagac atgtgataat 480 

cactgtagta gtagctggaa agagaaatct gtgactccaa ttagccagtt cctgcagacc 540 

ttgtga 546 



<210> 11 
<211> 12047 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Human glandular kallikrein-TRE 



<400> 11 

gaattcagaa ataggggaag gttgaggaag gacactgaac tcaaagggga tacagtgatt 60 

ggtttatttg tcttctcttc acaacattgg tgctggagga attcccaccc tgaggttatg 120 

aagatgtctg aacacccaac acatagcact ggagatatga gctcgacaag agtttctcag 180 

ccacagagat tcacagccta gggcaggagg acactgtacg ccaggcagaa tgacatggga 240 

attgcgctca cgattggctt gaagaagcaa ggactgtggg aggtgggctt tgtagtaaca 300 

agagggcagg gtgaactctg attcccatgg gggaatgtga tggtcctgtt acaaattttt 360 

caagctggca gggaataaaa cccattacgg tgaggacctg tggagggcgg ctgccccaac 420 

tgataaagga aatagccagg tgggggcctt tcccattgta ggggggacat atctggcaat 480 

agaagccttt gagacccttt agggtacaag tactgaggca gcaaataaaa tgaaatctta 540 

tttttcaact ttatactgca tgggtgtgaa gatatatttg tttctgtaca gggggtgagg 600 

gaaaggaggg gaggaggaaa gttcctgcag gtctggtttg gtcttgtgat ccagggggtc 660 

ttggaactat ttaaattaaa ttaaattaaa acaagcgact gttttaaatt aaattaaatt 720 

aaattaaatt ttactttatt ttatcttaag ttctgggcta catgtgcagg acgtgcagct 780 

ttgttacata ggtaaacgtg tgccatggtg gtttgctgta cctatcaacc catcacctag 840 

gtattaagcc cagcatgcat tagctgtttt tcctgacgct ctccctctcc ctgactccca 900 

caacaggccc cagtgtgtgt tgttcccctc cctgtgtcca tgtgttctca ttgttcagct 960 

cccacttata agtgagaaca tgtggtgttt ggttttctgt ttctgtgtta gtttgctgag 1020 

gataatggct tccacctcca tccatgttcc tgcaaaggac gtgatcttat tcttttttat 1080 

ggttgcatag aaattgtttt tacaaatcca attgatattg tatttaatta caagttaatc 1140 

taattagcat actagaagag attacagaag atattaggta cattgaatga ggaaatatat 12 00 

aaaataggac gaaggtgaaa tattaggtag gaaaagtata atagttgaaa gaagtaaaaa 1260 

aaaatatgca tgagtagcag aatgtaaaag aggtgaagaa cgtaiatagtg actttttaga 1320 

ccagattgaa ggacagagac agaaaaattt taaggaattg ctaaaccatg tgagtgttag 1380 

aagtacagtc aataacatta aagcctcagg aggagaaaag aataggaaag gaggaaatat 1440 

gtgaataaat agtagagaca tgtttgatgg attttaaaat atttgaaaga cctcacatca 1500 

aaggattcat accgtgccat tgaagaggaa gatggaaaag ccaagaagcc agatgaaagt 1560 

tagaaatatt attggcaaag cttaaatgtt aaaagtccta gagagaaagg atggcagaaa 1620 

tattggcggg aaagaatgca gaacctagaa tataaattca tcccaacagt ttggtagtgt 1680 

gcagctgtag ccttttctag ataatacact attgtcatac atcgcttaag cgagtgtaaa 1740 

atggtctcct cactttattt atttatatat ttatttagtt ttgagatgga gcctcgctct 1800 
gtctcctagg ctggagtgca atagtgcgat accactcact gcaacctctg cctcctctgt . 1860 

tcaagtgatt ttcttacctc agcctcccga gtagctggga ttacaggtgc gtgccaccac 192 0 

acccggctaa tttttgtatt ttttgtagag acggggtttt gccatgttgg ccaggctggt 1980 

cttgaactcc tgacatcagg tgatccacct gccttggcct cctaaagtgc tgggattaca 2040 

ggcatgagcc accgtgccca accactttat ttatttttta tttttatttt taaatttcag 2100 

cttctatttg aaatacaggg ggcacatata taggattgtt acatgggtat attgaactca 2160 

ggtagtgatc atactaccca acaggtaggt tttcaaccca ctccccctct tttcctcccc 2220 

attctagtag tgtgcagtgt ctattgttct catgtttatg tctatgtgtg ctccaggttt 2280 
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agctcccacc tgtaagtgag aacgtgtggt atttgatttt ctgtccetgt gttaattcac 2340 
ttaggattat ggcttccagc tccattcata ttgctgtaaa -ggatatgatt -catttttcat . . 2400. 
ggccatgcag tattccatat tgcgtataga tcacattttc tttctttttt ttttttgaga .. 2460 
cggagtcttg ctttgctgcc taggctggag tgcagtagca cgatctcggc tcactgcaag - 2520 

cttcacctcc ggggttcacg tcattcttct gtctcagctt cccaagtagc tgggactaca 2 580 

ggcgcccgcc accacgtccg gctaattttt ttgtgtgttt ttagtagaga tgggggtttc 2640 

actgtgttag ccaggatggt cttgatctcc tgaccttgtg gtccacctgc ctcggtctcc 2700 

caaagtgctg ggattacagg ggtgagccac tgcgcccggc ccatatatac cacattttct 2760 

ttaaccaatc caccattgat gggcaactag gtagattcca tggattccac agttttgcta 2820 

ttgtgtgcag tgtggcagta gacatatgaa tgaatgtgtc tttttggtat aatgatttgc 2880 

attcctttgg gtatacagtc attaatagga gtgctgggtt gaacggtggc tctgtttaaa 2 940 

attctttgag aattttccaa actgtttgcc atagagagca aactaattta catttccacg 3000 

aacagtat^at aagcattccc ttttctccac agctttgtca tcatggtttt tttttttctt 3060 

tattttaaaa aagaatatgt tgttgttttc ccagggtaca tgtgcaggat gtgcaggttt 3120 

gttacatagg tagtaaacgt gagccatggt ggtttgctgc acctgtcaac ccattacctg 318 0 

ggtatgaagc cctgcctgca ttagctcttt tccctaatgc tctcactact gccccaccct 3240 

caccctgaca gggcaaacag acaacctaca gaatgggagg aaatttttgc aatctattca 3300 

tctgacaaag gtcaagaata tccagaatct acaaggaact t:aagcaaatt: tttacttttt 3360 

aataatagcc actctgactg gcgtgaaatg gtatctcatt gtggttttca tttgaatttc 3420 

tctgatgatc agtgacgatg agcatttttt catatttgtt ggctgcttgt acgtcttttg 3480 

agaagtgtct cttcatgcct tttggccact ttaatgggat tattttttgc tttttagttt 3 54 0 
aagttcctta tagattctgg atattagact tcttattgga tgcatagttt gtgaatactc. - . 3600 

tcttccattc tgtaggttgt ctgtttactc tattgatggc ttcttttgct gtgccgaagc 3660 

atcttagttt aattagaaac cacctgccaa tttttgtttt tgttgcaatt gcttttgggg 3720 

acttagtcat aaactctttg ccaaggtctg ggtcaagaag agtatttcct aggttttctt 3780 

ctagaatttt gaaagtctga atgtaaacat: ttgcattttt aatgcatctt gagttagttt 3840 

ttgtatatgt gaaaggtcta ctctcatttt ctttccctct ttctttcttt ctttcttttc 3 900 

tttctttctt tctttctttc tttctttctt tctttctttc tttctttttg tccttctttc 3960 

tttctttctt tctctttctt tctctctttc tttttttttt ttgatggagt attgctctgt 4020 

tgcccaggct gcagtgcagc ggcacgatct cggctcactg caacctctgc ctcctgggtt 4080 

caactgattc tcctgcatca gccttccaag tagctgggat tataggcgcc cgccaccacg 4140 

cccgactaat ttttgtattt ttagtagaga cggggttgtg ccatgttggc caggctggtt 4200 

tgaaactcct gacctcaaac gatctgcctg ccttggcctc ccaaagtgct gggattacag 4260 

gtgtgagcca ctgtgcccag ccaagaatgt cattttctaa gaggtccaag aacctcaaga 4320 

tattttggga ccttgagaag agaggaattc atacaggtat tacaagcaca gcctaatggc 4380 

aaatctttgg catggcttgg cttcaagact ttaggctctt aaaagtcgaa tccaaaaatt 4440 

tttataaaag ctccagctaa gctaccttaa aaggggcctg tatggctgat cactcttctt 4500 

gctatacttt acacaaataa acaggccaaa tataatgagg ccaaaattta ttttgcaaat 4560 

aaattggtcc tgctatgatt tactcttggt aagaacaggg aaaatagaga aaaatttaga 4620 

ttgcatctga cctttttttc tgaattttta tatgtgccta caatttgagc taaatcctga 4680 

attattttct ggttgcaaaa actctctaaa gaagaacttg gttttcattg tcttcgtgac 4740 

acatttatct ggctctttac tagaacagct ttcttgtttt tggtgttcta gcttgtgtgc 4800 

cttacagttc tactcttcaa attattgtta tgtgtatctc atagttttcc ttcttttgag 4860 

aaaactgaag ccatggtatt ctgaggacta gagatgactc aacagagctg gtgaatctcc 4 92 0 

tcatatgcaa tccactgggc tcgatctgct tcaaattgct gatgcactgc tgctaaagct 4980 

atacatttaa aaccctcact aaaggatcag ggaccatcat ggaagaggag gaaacatgaa 5040 

attgtaagag ccagattcgg ggggtagagt gtggaggtca gagcaactcc accttgaata 5100 

agaaggtaaa gcaacctatc ctgaaagcta acctgccatg gtggcttctg attaacctct 5160 

gttctaggaa gactgacagt ttgggtctgt gtcattgccc aaatctcatg ttaaattgta 522 0 

atccccagtg ttcggaggtg ggacttggtg gtaggtgatt cggtcatggg agtagatttt 5280 

cttctttgtg gtgttacagt gatagtgagt gagttctcgt gagatctggt catttaaaag 5340 

tgtgtggccc ctcccctccc tctcttggtc ctcctactgc catgtaagat acctgctcct 5400 

gctttgcctt ctaccataag taaaagcccc ctgaggcctc cccagaagca gatgccacca 5460 

tgcttcctgt acagcctgca gaaccatcag ccaattaaac ctcttttctg tataaattac 5520 

cagtcttgag tatctcttta cagcagtgtg agaacggact aatacaaggg tctccaaaat 5580 

tccaagttta tgtattcttt cttgccaaat agcaggtatt taccataaat cctgtcctta 5640 

ggtcaaacaa ccttgatggc atcgtacttc aattgtctta cacattcctt ctgaatgact 5700 
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cctcccctat ggcatataag ccctgggtct tgggggataa tggcagaggg gtccaccatc 5760 
ttgtctggct gccacctgag acacggacat: ggcttctgtt ggtaagtctc tattaaatgt: . 5820 

ttctttctaa gaaactggat ttgtcagctt gtttctttgg cctctcagct tcctcagact 5880 

ttggggtagg ttgcacaacc ctgcccacca cgaaacaaat gtttaatatg ataaatatgg 5940 

atagatataa tccacataaa taaaagctct tggagggccc tcaataattg ttaagagtgt 6000 

aaatgtgtcc aaagatggaa aatgtttgag aactactgtc ccagagattt tcctgagttc 6060 

tagagtgtgg gaatatagaa cctggagctt ggcttcttca gcctagaatc aggagtatgg 6120 

ggctgaagtc tgaagcttgg cttcagcagt ttggggttgg cttccggagc acatatttga 6180 

catgttgcga ctgtgatttg gggtttggta tttgctctga atcctaatgt ctgtccttga 6240 

ggcatctaga atctgaaatc tgtggtcaga attctattat cttgagtagg acatctccag 6300 

tcctggttct gccttctagg gctggagtct gtagtcagtg acccggtctg gcatttcaac 6360 

ttcatataca gtgggctatc ttttggtcca tgtttcaacc aaacaaccga ataaaccatt 6420 

agaacctttc cccacttccc tagctgcaat gttaaaccta ggatttctgt ttaataggtt 6480 

catatgaata atttcagcct-gatccaactt tacattcctt ctaccgttat tctacaccca 6540 

ccttaaaaat gcattcccaa tatattccct ggattctacc tatatatggt aatcctggct 6600 

ttgccagttt ctagtgcatt aacatacctg atttacattc ttttacttta aagtggaaat 6660 

aagagtccct ctgcagagtt caggagttct caagatggcc cttacttctg acatcaattg 6 72 0 

agatttcaag ggagtcgcca agatcatcct caggttcagt gattgctggt agccctcata 6780 

taactcaatg aaagctgtta tgctcatggc tatggtttat tacagcaaaa gaatagagat 6840 

gaaaatctag caagggaaga gttgcatggg gcaaagacaa ggagagctcc aagtgcagag 6900 

attcctgttg ttttctccca gtggtgtcat ggaaagcagt atcttctcca tacaatgatg 6960 

tgtgataata ttcagtgtat tgccaatcag ggaactcaac tgagccttga ttatattgga 7020 

gcttggttgc acagacatgt cgaccacctt catggctgaa ctttagtact tagcccctcc 7080 

agacgtctac agctgaliagg ctgtaaccca acattgtcac cataaatcac attgttagac 7140 

tatccagtgt ggcccaagct cccgtgtaaa cacaggcact ctaaacaggc aggatatttc 7200 

aaaagcttag agatgacctc ccaggagctg aatgcaaaga cctggcctct ttgggcaagg 7260 

agaatccttt accgcacact ctccttcaca gggttattgt gaggatcaaa tgtggtcatg 732 0 

tgtgtgagac accagcacat gtctggctgt ggagagtgac ttctatgtgt gctaacattg 7380 

ctgagCgcta agaaagtatt aggcatggct: ^tcagcactc acagatgctc atctaatcct 7440 

cacaacatgg ctacagggtg ggcactacta gcctcatttg acagaggaaa ggactgtgga 7500 

taagaagggg gtgaccaata ggtcagagtc attctggatg caaggggctc cagaggacca 7560 

tgattagaca ttgtctgcag agaaattatg gctggatgtc tctgccccgg aaagggggat 762 0 

gcactttcct tgacccccta tctcagatct tgactttgag gttatctcag acttcctcta 768 0 

tgataccagg agcccatcat aatctctctg tgtcctctcc ccttcctcag tcttactgcc 7740 

cactcttccc agctccatct ccagctggcc aggtgtagcc acagtaccta actctttgca 7800 

gagaactata aatgtgtatc ct:acagggga gaaaaaaaaa aagaactctg aaagagctiga 7860 

cattttaccg acttgcaaac acataagcta acctgccagt tttgtgctgg tagaactcat 7920 

gagactcctg ggtcagaggc aaaagatttt attacccaca gctaaggagg cagcatgaac 798 0 

tttgtgttca catttgttca ctttgccccc caattcatat gggatgatca gagcagttca 804 0 

gigtggatgga cacaggggtt tgtggcaaag gtgagcaacc taggcttaga aatcctcaat 8100 

cttataagaa ggtactagca aacttgtcca gtctttgtat ctgacggaga tattatcttt 8160 

ataattgggt tgaaagcaga cctactctgg aggaacatat tgtatttatt gtcctgaaca 8220 

gtaaacaaat ctgctgtaaa atagacgtta actttattat ctaaggcagt aagcaaacct 82 8 0 

agatctgaag gcgataccat cttgcaaggc tatctgctgt acaaatatgc ttgaaaagat 834 0 

ggtccagaaa agaaaacggt attattgcct ttgctcagaa gacacacaga aacataagag 8400 

aaccatggaa aattgtctcc caacactgtt cacccagagc cttccactct tgtctgcagg 8460 

acagtcttaa catcccatca ttagtgtgtc taccacatct ggcttcaccg tgcctaacca 8520 

agatttctag gtccagttcc ccaccatgtt tggcagtgcc ccactgccaa ccccagaata 8580 

agggagtgct cagaattccg aggggacatg ggtggggatc agaacttctg ggcttgagtg 8640 

cagagggggc ccatactcct tggttccgaa ggaggaagag gctggaggtg aatgtccttg 8700 

gaggggagga atgtgggttc tgaactctta aatccccaag ggaggagact ggtaaggtcc 8760 

cagcttccga ggtactgacg tgggaatggc ctgagaggtc taagaatccc gtatcctcgg 8820 

gaaggagggg ctgaaattgt gaggggttga gttgcagggg tttgttagct tgagactcct 8880 

tggtgggtcc ctgggaagca aggactggaa ccattggctc cagggtttgg tgtgaaggta 8 940 

atgggatctc ctgattctca aagggtcaga ggactgagag ttgcccatgc tttgatcttt 9000 

ccatctactc cttactccac ttgagggtaa tcacctactc ttctagttcc acaagagtgc 9060 

gcctgcgcga gtataatctg cacatgtgcc atgtcccgag gcctggggca tcatccactc 9120 
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atcattcagc atctgcgcta tgcgggcgag gccggcgcca tgacgtcatg tagctgcgac 9180 
tatccctgca gcgcgcctct cccgtcacgt cccaaccatg gagctgtgga . cgtgcgtccc . . 9240 

ctggtggatg tggcctgcgt ggtgccaggc cggggcctgg tgtccgataa agatcctaga 9300 

accacaggaa accaggactg aaaggtgcta gagaatggcc atatgtcgct gtccatgaaa 9360 

tctcaaggac ttctgggtgg agggcacagg agcctgaact tacgggtttg ccccagtcca 942 0 

ctgtcctccc aagtgagtct cccagatacg aggcactgtg ccagcatcag cttcatctgt 9480 

accacatctt gtaacaggga ctacccagga ccctgatgaa caccatggtg tgtgcaggaa 9540 

gagggggtga aggcatggac tcctgtgtgg tcagagccca gagggggcca tgacgggtgg 9600 

ggaggaggct gtggactggc tcgagaagtg ggatgtggtt gtgtttgatt tcctttggcc 9660 

agataaagtg ctggatatag cattgaaaac ggagtatgaa gaccagttag aatggagggt 972 0 

caggttggag ttgagttaca gatggggtaa aattctgctt cggatgagtt tggggattgg 9780 

caatctaaag gtggtttggg atggcatggc tttgggatgg aaataggttt gtttttatgt 9840 

tggctgggaa gggtgtgggg attgaattgg ggatgaagta ggtttagttt tggagataga 9900 

atacatggag ctggctattg catgcgagga tgtgcattag tttggtttga tctttaaata 9960 

aaggaggcta ttagggttgt cttgaattag attaagttgt gttgggttga tgggttgggc 10020 

ttgtgggtga tgtggttgga ttgggctgtg ttaaattggt ttgggtcagg ttttggttga 10080 

ggttatcatg gggatgagga tatgcttggg acatggattc aggtggttct cattcaagct 1014 0 

gaggcaaatt tcctttcaga cggtcattcc agggaacgag tggttgtgtg ggggaaatca 10200 

ggccactggc tgtgaatatc cctctatcct ggtcttgaat tgtgattatc tatgtccatt 10260 

ctgtctcctt cactgtactt ggaattgatc tggtcattca gctggaaatg ggggaagatt 10320 

ttgtcaaatt cttgagacac agctgggtct .ggatcagcgt aagccttcct tctggtttta 10380 

ttgaacagat gaaatcacat tttttttttc aaaatcacag aaatcttata gagttaacag 10440 

tggactctta taataagagt: taacaccagg actcttattc ttgattcttt tctgagacac 10500 

caaaatgaga tttctcaatg ccaccctaat tctttttttt tttttttttt tttttgagac 10560 

acagtctggg tcttttgctc tgtcactcag gctggagcgc agtggtgtga tcatagctca 10620 

ctgaaccctt gacctcctgg acttaaggga tcctcctgct tcagcctcct gagtagatgg 10680 

ggctacaggt gcttgccacc acacctggct aattaaattt tttttttttt tttgtagaga 10740 

aagggtctca ctttgttgcc ctggctgatc ttgaacttct gacttcaagt gattcttcag 10800 

ccttggactc ccaaagcact gggattgctg gcatgagcca ctcaccgtgc ctggcttgca 10860 

gcttaatctt ggagtgtata aacctggctc ctgatagcta gacatttcag tgagaaggag 10920 

gcattggatt ttgcatgagg acaattctga cctaggaggg caggtcaaca ggaatccccg 10980 

ctgtacctgt acgttgtaca ggcatggaga atgaggagtg aggaggccgt accggaaccc 11040 

catattgttt agtggacatt ggattttgaa ataataggga acttggtctg ggagagtcat lllOO 

atttctggat tggacaatat gtggtatcac aaggttttat gatgagggag aaatgtatgt 11160 

ggggaaccat tttctgagtg tggaagtgca agaatcagag agtagctgaa tgccaacget 11220 

tctatttcag gaacatggta agttggaggt ccagctctcg ggctcagacg ggtataggga 11280 

ccaggaagtc tcacaatccg atcattctga tatttcaggg catattaggt ttggggtgca 11340 

aaggaagtac ttgggactta ggcacatgag actttgtatt gaaaatcaat gattggggct 114 00 

ggccgtggtg ctcacgcctg taatctcatc actttgggag accgaagtgg gaggatggct 11460 

tgatctcaag agttggacac cagcctaggc aacatggcca gaccctctct ctacaaaaaa 11520 

attaaaaatt agctggatgt ggtggtgcat gcttgtggtc tcagctatcc tggaggctga 11580 

gacaggagaa tcggttgagt ctgggagttc aaggctacag ggagctgcga tcacgccgct 1164 0 

gcactccagc ctgggaaaca gagtgagact gtctcagaat ttttttaaaa aagaatcagt 11700 

gatcatccca acccctgttg ctgttcatcc tgagcctgcc ttctctggct ttgttcccta 11760 

gatcacatct ccatgatc.ca taggccctgc ccaatctgac ctcacaccgt gggaatgcct 11820 

ccagactgat ctagtatgtg tggaacagca agtgctggct ctccctcccc ttccacagct 11880 

ctgggtgtgg gagggggttg tccagcctcc agcagcatgg ggagggcctt ggtcagcatc 11940 

taggtgccaa cagggcaagg gcggggtcct ggagaatgaa ggctttatag ggctcctcag 12000 

ggaggccccc cagccccaaa ctgcaccacc tggccgtgga caccggt 12047 

<210> 12 
<211> 67 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HRE-TRE 
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<400> 12 

ccccgaggca gtgcatgagg ctcagggcgt gcgtgagtcg cagcgagacc ccggggtgca 60 
ggccgga 67 

<210> 13 
<211> 5835 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PSA-TRE 



<400> 13 

aagcttctag 

atctgtgaca 

agagattttt 

ggtgcaacct 

gcctcctgag 

ttttagtaga 

gatctgccca 

ccgatatcca 

ttgactttta 

cagcacaaat 

actctaatct 

tggaaaagaa 

gaggggaaac 

agggtcattg 

tggggagtgc 

atcttcagca 

ctggcctcat 

gacactctgg 

attgtggcct 

cttgaccctc 

ttctgtaccc 

tatctctatt 

aaggggctga 

tgctggcaga 

agcagacagc 

ggtgaccaga 

cctgagatta 

tctctgatga 

catattgtat 

tatctaggac 

tttacaaaca 

aagacacaca 

gccttccacc 

catcccgctt 

ctccactgcc 

tcagaacttc 

«^Srsrctggatg 

agggaggaga 

ctcatatcct 

ggcttaaggc 

gtgatagtaa 

tccatcttgc 

gaaggtcccc 



ttttcttttc 
atattcacag 
gtgttttttt 
tggctcactg 
tagctgggac 
gatggggttt 
ccttggcctc 
gagatttttt 
gtatccagcc 
cacaccgtta 
ggcaggatac 
aaagaaagaa 
gcctgaggtc 
tgacgatcaa 
cgtgtaagtg 
cttacagatg 
ttgatggaga 
atgctgggga 
gatgtccctg 
tcttttaggg 
tcttgactct 
cccagctggc 
cattttactg 
gtccatgaga 
atgaggttca 
gcagtctagg 
ggaatcctca 
agatattatc 
cgattgtcct 
agtaagcaag 
tccttgaaac 
gatacgtgac 
ctttctgcag 
ctttatgcct 
aaacccagaa 

tgggtttgag 

tgaaggtact 
ctggtaaggt 
caggaagaag 
tctttggccc 
tgggatctct 
cacctaatcc 
tgggcaagca 



ccggtgacat 
tgtaatgcca 
ctgagactga 
caagctccgc 
tacaggcacc 
cactgtgtta 
ccaaagtgct 

ggggggctcc 

cctctagaaa 
gactatctgg 
tccaaagcat 
aggaaaaaaa 
tttgagcaag 
atgtggtcac 
tatgcttgca 
ctcatctcat 
aagtggctgt 
ctccagagac 
tcctggagag 
ctctttctga 
atgaccccca 
cagtgcagtc 
acttgcaaac 
ctcctgagtc 
tgttcacatt 
tggatgctgt 
atcttatact 
ttcatgatct 
tgacagtaaa 
cctggatctg 
aacaatccag 
agaaccatgg 
gacagtctca 
aaccaaggtt 
taaggcagcg 
tgaggagtgg 
gggggaggga 
cccagctccc 
gtgctggaat 
ctgaaggcag 
tgattcctca 
ttactccact 
caatctgagc 



cgtggaaagc 
tccagggaac 
gtctcgctct 
ctcctgggtt 
cgccaccacg 
gccaggatgg 
gggatgacag 
atcacacaga 
tctagctgat 
tgtggcccaa 
tagagatgac 
aaaaaaaaaa 
gtcagtcctc 
gtgtatgagg 
ctgctgaatg 
cctcacagca 
ggctcagaaa 
catgaccact 
ggtggaggtg 
cctccaccat 
ccgcccactg 
tcagtgccca 
aaataagcta 
agaggcaaag 
agtacacctt 
gcagaagggg 
gggacaactt 
tggattgaaa 
caaatctgtt 
agagagatat 
aaaaaaaaag 
agaattgcct 
acgttccacc 
ctaggtcccg 
ctcaggatcG 
gtccaccctc 
aagtgtcagt 
gaggtactga 
cctgaggggt 
aggctggaac 
agagtctgag 
tgagggtatc 
atgaaagatg 



actagcatct 
tcaactgagc 
gtgccaggct 
cacgccattc 
cctggctaat 
tctcagtctc 
gcgtgagcca 
catgttgact 
atagtgtggc 
accttcaggt 
ctcttgcaaa 
gagatgacct 
tgttgcacag 
caccagcaca 
gctgggatgt 
tcacfcatggg 
gggggga c ca 
caccaactgc 
gaccttcact 
ggtactagga 
catccagetg 
cctgtttgtc 
actttccaga 
gcttttactg 
gcccccccca 
tttgtgccac 
gcaaacctgc 
acagacctac 
gtaagagaca 
catcttgcaa 
gtgttactgt 
cccaacgctg 
attaaatact 
atcgactgtg 
cgaaggggca 
ttgaatttca 
tccgaactct 
tgtgggaatg 
agagttctgg 
cattaggtcc 
gatcgagggt 
accagccctt 
ccccagaggc 



ctaagcaatg 
cttgatgtcc 
ggagtgcagt 
tcctgcctca 
ttttttgtat 
ctgacctcgt 
ccgcgcctgg 
gtcttcatgg 
tcaaaacctt 
gaacaaaggg 
gaaaaagaaa 
ctcaggctct 
tctccctcac 
tgcctggctc 
gtcagggatt 
atgggtatta 
ctagaccagg 
agagaaatta 
aacctcctac 
ccccattgta 
ggtcccctcc 
agtaactctg 
gttttgtgaa 
ctcacagctt 
aatcttgtag 
tggtgagaaa 
tcagcctttg 
tctggaggaa 
ttatctttat 
ggatgcctgc 
ctttgctcag 
ttcagccaga 
tcttctatca 
tctggcagca 
tggctgggga 
aaggaggaag 
taggtcaatg 
gcctaagaat 
gtatatttgt 
agggtttggg 
tgcccattct 
ctagctccat 
cttgggtgtc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
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atccactcat catccagcat cacactctga gggtgtggcc agcaccatga cgtcatgttg 2640 

ctgtgactat ccctgcagcg tgcctctccA gccacctgcc aaccgtagag ctgcGcatcc 2700 

tcctctggtg ggagtggcct gcatggtgcc aggctgaggc ctagtgtcag acagggagcc 2760 

tggaatcata gggatccagg actcaaaagt gctagagaat ggccatatgt caccatccat 2 82 0 

gaaatctcaa gggcttctgg gtggagggca cagggacctg aacttatggt ttcccaagtc 2880 

tattgctctc ccaagtgagt ctcccagata cgaggcactg tgccagcatc agccttatct 2 940 

ccaccacatc ttgtaaaagg actacccagg gccctgatga acaccatggt gtgtacagga 3000 

gtagggggtg gaggcacgga ctcctgtgag gtcacagcca agggagcatc atcatgggtg 3060 

gggaggaggc aatggacagg cttgagaacg gggatgtggt tgtatttggt tttctttggt 3120 

tagataaagt gctgggtata ggattgagag tggagtatga agaccagtta ggatggagga 318 0 

tcagatfcgga gttgggttag ataaagtgct gggtatagga ttgagagtgg agtatgaaga 3240 
ccagttagga tggaggatca gattggagtt gggttagaga tggggtaaaa ttgtgctccg . .33 00 

gatgagtttg ggattgacac tgtggaggtg gtttgggatg gcatggcttt gggatggaaa 3360 

tagatttgtt ttgatgttgg ctcagacatc cttggggatt gaactgggga tgaagctggg 3420 

tttgattttg gaggtagaag acgtggaagt agctgtcaga tttgacagtg gccatgagtt 34 8 0 

ttgtttgatg gggaatcaaa caatggggga agacataagg gttggcttgt taggttaagt 3540 

tgcgttgggt tgatggggtc ggggctgtgt ataatgcagt tggattggtt tgtattaaat 3600 

tgggttgggt caggttttgg ttgaggatga gttgaggata tgcttgggga caccggatcc 3660 

atgaggttct cactggagtg gagacaaact tcctttccag gatgaatcca gggaagcctt 3720 

aattcacgtg taggggaggt caggccactg gctaagtata tccttccact ccagctctaa 3780 
gatggtctta aattgtgatt atctatatcc acttctgtct ccctcactgt gcttggagtt . 3840 

tacctgat:ca ctcaactaga aacaggggaa gattttatca aattcttttt tttttttttt 3 900 

tttttttgag acagagtctc actctgttgc ccaggfetgga gtgcagtggc gcagtctcgg 3960 

ctcactgcaa cctctgcctc ccaggttcaa gtgattctcc tgcctcagcc tcctgagttg 4020 

ctgggattac aggcatgcag caccatgccc agctaatttt tgtattttta gtagagatgg 4080 

ggtttcacca atgtttgcca ggctggcctc gaactcctga cctggtgatc cacctgcctc 414 0 

agcctcccaa agtgctggga ttacaggcgt cagccaccgc gcccagccac ttttgtcaaa 42 00 

ttcttgagac acagctcggg ctggat-caag tgagctactc tggttttatt gaacagctga 4260 

aataaccaac tttttggaaa ttgatgaaat cttacggagt taacagtgga ggtaccaggg 4320 

ctcttaagag ttcccgattc tcttctgaga ctacaaattg tgattttgca tgccacctta 4380 

atcttttttt tttttttttt aaatcgaggt ttcagtctca ttctatttcc caggctggag 444 0 

ttcaatagcg tgatcacagc tcactgtagc cttgaactcc tggccttaag agattctcct 4500. 

gcttcggtct cccaatagct aagactacag tagtccacca ccatatccag ataattttta 4560 

aattttttgg ggggccgggc acagtggctc acgcctgtaa tcccaacacc atgggaggct 4620 

gagatgggtg gatcacgagg tcaggagttt gagaccagcc tgaccaacat ggtgaaactc 4680 

tgtctctact aaaaaaaaaa aaaatagaaa aattagccgg gcgtggtggc acacggcacc 4740 

tgtaatccca gctactgagg aggctgaggc aggagaatca cttgaaccca gaaggcagag 4800 

gttgcaatga gccgagattg cgccactgca ctccagcctg ggtgacagag tgagactctg 486 0 

tctcaaaaaa aaaaaatttt tttttttttt ttgtagagat ggatcttgct ttgtttctct 492 0 

ggttggcctt gaactcctgg cttcaagtga tcctcctacc ttggcctcgg aaagtgttgg 4980 

gattacaggc gtgagccacc atgactgacc tgtcgttaat cttgaggtac ataaacctgg 5040 

ctcctaaagg ctaaaggcta aatatttgtt ggagaagggg cattggattt tgcatgagga 5100 

tgattctgac ctgggagggc aggtcagcag gcatctctgt tgcacagata gagtgtacag 5160 

gtctggagaa caaggagtgg ggggttattg gaattccaca ttgtttgctg cacgttggat 52 2 0 

tttgaaatgc tagggaactt tgggagactc atatttctgg gctagaggat ctgtggacca 5280 

caagatcttt ttatgatgac agtagcaatg tatctgtgga gctggattct gggttgggag 5340 

tgcaaggaaa agaatgtact aaatgccaag acatctattt caggagcatg aggaataaaa 54 00 

gttctagttt ctggtctcag agtggtgcat ggatcaggga gtctcacaat ctcctgagtg 546 0 

ctggtgtctt agggcacact gggtcttgga gtgcaaagga tctaggcacg tgaggctttg 5520 

tatgaagaat cggggatcgt acccaccccc tgtttctgtt tcatcctggg catgtctcct 5580 

ctgcctttgt cccctagatg aagtctccat gagctacaag ggcctggtgc atccagggtg 5640 

atctagtaat tgcagaacag caagtgctag ctctccctcc ccttccacag ctctgggtgt 5700 

gggagggggt tgtccagcct ccagcagcat ggggagggcc ttggtcagcc tctgggtgcc 5760 

agcagggcag gggcggagtc ctggggaatg aaggttttat agggctcctg ggggaggctc 582 0 

cccagcccca agctt 583 5 



<210> 14 
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7) 



<211> 15056 
<212> DNA - 

<213> Artificial Sequence 

<220> 

<223> CEA TRE 



<400> 14 

aagcttttta gtgctttaga cagtgagctg gtctgtctaa cccaagtgac ctgggctcca 60 

tactcagccc cagaagtgaa gggtgaagct gggtggagcc aaaccaggca agcctaccct 12 0 

cagggctccc agtggcctga gaaccattgg acccaggacc cattacttct agggtaagga 180 

aggtacaaac accagatcca accatggtct ggggggacag ctgtcaaatg cctaaaaata 240 

tacctgggag aggagcaggc aaactatcac tgccccaggt tctctgaaca gaaacagagg 300 

ggcaacccaa agtccaaatc caggtgagca ggtgcaccaa atgcccagag atatgacgag 360 

gcaagaagtg aaggaaccac ccctgcatca aatgttttgc atgggaagga gaagggggtt 42 0 

gctcatgttc ccaatccagg agaatgcatt tgggatctgc cttcttctca ctccttggtt 480 

agcaagacta agcaaccagg actctggatt tggggaaaga cgtttatttg tggaggccag 54 0 

tgatgacaat cccacgaggg cctaggtgaa gagggcagga aggctcgaga cactggggac 600 

tgagtgaaaa ccacacccat gatctgcacc acccatggat get cct teat tgctcacctt 660 

tctgttgata tcagatggcc ccattttctg taccttcaca gaaggacaca ggctagggtc 720 

tgtgcatggc cttcatcccc ggggccatgt gaggacagca ggtgggaaag atcatgggtc 780 

ctcctgggtc ctgcagggcc agaacattca tcacccatac tgacctccta gatgggaatg 840 

gcttccctgg ggctgggcca acggggcctg ggcaggggag aaaggacgtc aggggacagg 900 

gaggaagggt catcgagacc cagcctggaa ggttcttgtc tctgaccatc caggatttac 960 

ttccctgcat ctacctttgg tcattttccc tcagcaatga ccagctctgc ttcctgatct 102 0 

cagcctccca ccctggacac agcaccccag tccctggccc ggctgcatcc acccaatacc 1080 

ctgataaccc aggacccatt acttctaggg taaggagggt ccaggagaca gaagctgagg 1140 

aaaggtctga agaagtcaca tctgtcctgg ccagagggga aaaaccatca gatgctgaac 1200 

caggagaatg ttgacccagg aaagggaccg aggacccaag aaaggagtca gaccaccagg 1260 

gtttgcctga gaggaaggat caaggccccg agggaaagca gggctggctg catgtgcagg 1320 

acactggtgg ggcatatgtg tcttagattc tccctgaatt cagtgtccct gccatggcca 1380 

gactctctac tcaggcctgg acatgctgaa ataggacaat ggccttgtcc tctctcccca 144 0 

ccatttggca agagacataa aggacattcc aggacatgcc ttcctgggag gtccaggttc 1500 

tctgtctcac acctcaggga ctgtagttac tgcatcagcc atggtaggtg ctgatctcac 1560 

ccagcctgtc caggcccttc cactctccac tttgtgacca tgtccaggac - cacccctcag 1620 

atcctgagcc tgcaaatacc cccttgctgg gtgggtggat tcagtaaaca gtgagctcct 1680 

atccagcccc cagagccacc tctgtcacct tcctgctggg catcatccca ccttcacaag 1740 

cactaaagag catggggaga cctggctagc tgggtttctg catcacaaag aaaataatcc 1800 

cccaggttcg gattcccagg gctctgtatg tggagctgac agacctgagg ccaggagata 1860 

gcagaggtca gccctaggga gggtgggtca tccacccagg ggacaggggt gcaccagcct 1920 

tgctactgaa agggcctccc caggacagcg ccatcagccc tgcctgagag ctttgctaaa 1980 

cagcagtcag aggaggccat ggcagtggct gagctcctgc tccaggcccc aacagaccag 2040 

accaacagca caatgcagtc cttccccaac gtcacaggtc accaaaggga aactgaggtg 2100 

ctacctaacc ttagagccat caggggagat aacagcccaa tttcccaaac aggccagttt 2160 

caatcccatg acaatgacct ctctgctctc attcttccca aaataggacg ctgattctcc 2220 

cccaccatgg atttctccct tgtcccggga gccttttctg ccccctatga tctgggcact - 2280 

cctgacacac acctcctctc tggtgacata tcagggtccc tcactgtcaa gcagtccaga 2340 

aaggacagaa ccttggacag cgcccatctc agcttcaccc ttcctccttc acagggttca 2400 

gggcaaagaa taaatggcag aggccagtga gcccagagat ggtgacaggc agtgacccag 2460 

gggcagatgc ctggagcagg agctggcggg gccacaggga gaaggtgatg caggaaggga 2520 

aacccagaaa tgggcaggaa aggaggacac aggctctgtg gggctgcagc ccagggttgg 2580 

actatgagtg tgaagccatc tcagcaagta aggccaggtc ccatgaacaa gagtgggagc 2640 

acgtggcttc ctgctctgta tatggggtgg gggattccat gccccataga accagatggc 2700 

cggggttcag atggagaagg agcaggacag gggatcccca ggataggagg accccagtgt 2 76 0 

ccccacccag gcaggtgact gatgaatggg catgcagggt cctcctgggc tgggctctcc 2820 

ctttgtccct caggattcct tgaaggaaca tccggaagcc gaccacatct acctggtggg 2880 

ttctggggag tccatgtaaa gccaggagct tgtgttgcta ggaggggtca tggcatgtgc 2940 
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tgggggcacc aaagagagaa acctgagggc aggcaggacc tggtctgagg aggcatggga . . .. 3000 

gcccagatgg ggagatggat gtcaggaaag gctgccccat .cagggagggt gatagcaatg 30.60. 

gggggtctgt gggagtgggc acgtgggatt ccctgggctc tgccaagttc cctcccatag .. 3120 

tcacaacctg gggacactgc ccatgaaggg gcgcctttgc ccagccagat gctgctggtt 3180 

ctgcccatcc actaccctct ctgctccagc cactctgggt ctttctccag atgccctgga 324 0 

cagccctggc ctgggcctgt cccctgagag gtgttgggag aagctgagtc tctggggaca 33 00 

ctctcatcag agtctgaaag gcacatcagg aaacatccct ggtctccagg actaggcaat 3360 

gaggaaaggg ccccagctcc tccctttgcc actgagaggg tcgaccctgg gtggccacag 3420 

tgacttctgc gtctgtccca gtcaccctga aaccacaaca aaaccccagc cccagaccct 3480 

gcaggtacaa tacatgtggg gacagtctgt acccagggga agccagttct etc t tec tag 3 540 

gagaccgggc ctcagggctg tgcccggggc aggcgggggc agcacgtgcc tgtccttgag 3600 

aactcgggac cttaagggtc tctgctctgt gaggcacagc aaggatcctt ctgtccagag 3660 

atgaaagcag ctcctgcccc tcctctgacc tcttcctcct tcccaaatct caaccaacaa 3720 

ataggtgttt caaatctcat catcaaatct tcatccatcc acatgagaaa gcttaaaacc 3780 

caatggattg acaacatcaa gagttggaac aagtggacat ggagatgtta cttgtggaaa 38 4 0 

tttagatgtg ttcagctatc gggcaggaga atctgtgtca aattccagca tggttcagaa 3 900 

gaatcaaaaa gtgtcacagt ccaaatgtgc aacagtgcag gggataaaac tgtggtgcat 3960 

tcaaactgag ggatattttg gaacatgaga aaggaaggga ttgctgctgc acagaacatg 4020 

gatgatctca cacatagagt tgaaagaaag gagtcaatcg cagaatagaa aatgatcact 4080 

aattccacct ctataaagtt tccaagagga aaacccaatt ctgctgctag agatcagaat 4140 

ggaggtgacc tgtgccttgc aatggctgtg agggtcacgg gagtgtcact tagtgcaggc 4200 
aatgtgccgt atcttaatct gggcagggct ttcatgagca cataggaatg cagacattac 4260 

tgctgtgttc attttacttc accggaaaag aagaataaaa tcagccgggc gcggtggctc 4320 

acgcctgtaa tcccagcact ttagaaggct gaggtgggca gattacttga ggtcaggagt 4380 

tcaagaccac cctggccaat atggtgaaac cccggctcta ctaaaaatac aaaaattagc 4440 

tgggcatggt ggtgcgcgcc tgtaatccca gctactcggg aggctgaggc tggacaattg 4500 

cttggaccca ggaagcagag gttgcagtga gccaagattg tgccactgca ctccagcttg 4560 

ggcaacagag ccagactctg taaaaaaaaa aaaaaaaaaa aaaaaaagaa agaaagaaaa 4620 

agaaaagaaa gtataaaatc tctttgggtt aacaaaaaaa gatccacaaa acaaacacca 4680 

gctcttatca aacttacaca actctgccag agaacaggaa acacaaatac tcattaactc 4740 

acttttgtgg caataaaacc ttcatgtcaa aaggagacca ggacacaatg aggaagtaaa 4800 

actgcaggcc ctacttgggt gcagagaggg aaaatccaca aataaaacat taccagaagg 486 0 

agctaagatt tactgcattg agttcattcc ccaggtatgc aaggtgattt taacacctga 4920 

aaatcaatca ttgcctttac tacatagaca gattagctag aaaaaaatta caactagcag 4980 

aacagaagca atttggcctt cctaaaattc cacatcatat catcatgatg gagacagtgc 5040 

agacgccaat gacaataaaa agagggacct ccgtcacccg gtaaacatgt ccacacagct 5100 

ccagcaagca cccgtcttcc cagtgaatca ctgtaacctc ccctttaatc agccccaggc 516 0 

aaggctgcct gcgatggcca cacaggctcc aacccgtggg cctcaacctc ccgcagaggc 522 0 

tctcctttgg ccaccccatg gggagagcat gaggacaggg cagagccctc tgatgcccac 5280 

acatggcagg agctgacgcc agagccatgg gggctggaga gcagagctgc tggggtcaga 5340 

gcttcctgag gacacccagg cctaagggaa ggcagctccc tggatggggg caaccaggct 5400 

ccgggctcca acctcagagc ccgcatggga ggagccagca ctctaggcct ttcctagggt 5460 

gactctgagg ggaccctgac acgacaggat cgctgaatgc acccgagatg aaggggccac 5520 

cacgggaccc tgctctcgtg gcagatcagg agagagtggg acaccatgcc aggcccccat 558 0 

ggcatggctg cgactgaccc aggccactcc cctgcatgca tcagcctcgg taagtcacat 5640 

gaccaagccc aggaccaatg tggaaggaag gaaacagcat cccctttagt gatggaaccc 5700 

aaggtcagtg caaagagagg ccatgagcag ttaggaaggg tggtccaacc tacagcacaa 5760 

accatcatct atcataagta gaagccctgc tccatgaccc ctgcatttaa ataaacgttt 5820 

gttaaatgag tcaaattccc tcaccatgag agctcacctg tgtgtaggcc catcacacac 5880 

acaaacacac acacacacac acacacacae acacacacac acagggaaag tgcaggatcc .5940 

tggacagcac caggcaggct tcacaggcag agcaaacagc gtgaatgacc catgcagtgc 6000 

cctgggcccc atcagctcag agaccctgtg agggctgaga tggggctagg caggggagag 6060 

acttagagag ggtggggcct ccagggaggg ggctgcaggg agctgggtac tgccctccag 6120 

gg^gggggct gcagggagct gggtactgcc ctccagggag ggggctgcag ggagctgggt 6180 

actgccctcc agggaggggg ctgcagggag ctgggtactg ccctccaggg agggggctgc 6240 

agggagctgg gtactgccct ccagggaggc aggagcactg ttcccaacag agagcacatc 6 3 00 

ttcctgcagc agctgcacag acacaggagc ccccatgact gccctgggcc agggtgtgga 6360 
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^tccaaattt 
tgtgattcca 
tatctcaaat 
ggggtcctcc 
tgctcctcct 
gatcctagag 
taggggtgtc 
tgacacaggg 
cctgaggagc 
aagcctgagt 
tgtgtacctc 
cctaacatgc 
cattgcaagg 
tgcggtcagg 
ttggcctgga 
cacatccctc 
ccccactggg 
acagatcctg 
acacagagca 
ccccaatgac 
agattgaggg 
ctagagtgca 
aagggaggac 
tgagcaaaaa 
tggagaggac 
ggagcaggta 
ctcagcattg 
gcaccatctg 
accaacaatg 
atcactgatc 
agaagaatca 
ttttacttaa 
atgccacatg 
accacaactg 
agactttggg 
tttgaggctc 
gaaaacatgt 
ctttatgttc 
gtaaatgtta 
agtgccctgc 
cattaggagg 
tagacactgg 
atttctgtcc 
atttttgagg 
tgtgtcccca 
ccacctcttg 
aaagggaaat 
taaaatccta 
taacagtaaa 
tttttttctt 
caatatgctg 
tcttaaatgc 
ccagccccac 
taaccttgct 
ctggagggtg 
tcactggaac 
agtcaaatta 



cgtgccccat 
aacttaaact 
tccctaagca 
catgccccca 
cttggctcaa 
ctggccccat 
aagagagctg 
ctcagggctt 
ctcagtggga 
gctctaagga 
cccgctgccc 
atctttcctg 
agtggaggtc 
aggatcacac 
ggccactggt 
ctgattcttt 
aaaggcccag 
agtgagagaa 
ggggcatctg 
gtgacccctg 
tcctcccttg 
gagggctggg 
at:gagct:cca 
ggggccagga 
tgaggtgcaa 
aggggaagcc 
acatttgggg 
gggactctac 
tctccagact 
tacgtcagtc 
caagtgtgag 
tcagcagttt 
gctgcctgta 
gaatgacact 
aagggaaatg 
tcaccatcta 
tttattaaga 
aaattcatat 
tcactgactg 
ttccccagcc 
aagagaggaa 
gaagaatcag 
cttgtccatt 
agaaggttct 
tcaccattac 
agggtgtata 
cgcattacta 
agagagccca 
ctagaataat 
agaaaaagca 
cctgtaattt 
aaaactaaag 
gaagagcaga 
cgataaagct 
gtgagcctca 
cttgaacact 
ctgcccactt 



tgggtgggac 
actgtgccta 
caagctagca 
gtctgacttg 
ctgccgcGcc 
gattgacagg 
ggcatcccac 
cagacgggtc 
aaagaagcac 
aatgctccca 
atcctctcac 
tctcattcca 
acgttcccac 
gtccccccat 
cccctctgtc 
cagctgaggg 
tgtggacggt 
cctttaggga 
gaccctgagt 
gggtggctcc 
agtccctgat 
attgtggcag 
ggaagggcga 
gagttgagag 
agagggggct 
ccagggaggc 
tggtcgtgct 
ccactaaatg 
ttccaaatgt 
tctaaaagtg 
aggggtagaa 
gatgtcccag 
cctcacagca 
cactgggtta 
tatgacgtga 
ggagacatct 
ggaaaaatct 
acttttagat 
gatagtgttg 
ctgggccctc 
gggtgttttt 
ttgcccactc 
tcaacaatgt 
caagataagt 
cagcagcatt 
ctgtatgctc 
ttcagagaga 
tataaaatta 
aaaacatgca 
ttccatgtgt 
ttgttccctg 
gcagctcctg 
ggcctgagct 
gtgttcccag 
ctggtggcct 
tggctgtcgc 
ctccagggca 



ggaggttgac 
caaaatagga 
ccctttaaat 
caggtgcaca 
tcctgggggt 
aaggcaggac 
agagctgcac 
gggaggctca 
tgaagtggga 
ccccgatgta 
agcccccgcc 
cacaaaaggg 
agaccaccca 
gcccagggga 
cctgagggga 
cccttcttga 
tccacagcag 
cacaggtgca 
gtgtagctcc 
aggtctccag 
gcctgtccag 
taaaagcagc 
tggcgtcctc 
atcagggctg 
gaagtagggg 
cgggggaggg 
agtggggttc 
ccagcaggac 
cccctggaga 
actcatcagc 
actgcagact 
gagaagatac 
ggagcagagt 
cattacaaag 
gcccacagcc 
cagccatgaa 
aggctagaag 
cattccttaa 
gtgtctcact 
tctgattcct 
aatattctca 
ttggatttga 
gacaggccta 
tttctcacac 
tggacccttt 
tctacacagg 
agaagacctt 
ttaccagtgc 
tcacagttgc 
gttgcagtga 
gcagaatgta 
ggccccctcc 
tccctggtca 
aatgtcgctc 
gatgcttacc 
ccggatctgc 
gatacctgtg 



cgtgacatcc 
aataacGcta 
caggaagttG 
gggtggctga 

gactgatggt 
ttggcctcca 
aagatgacgc 
gctgagagtt 
agttctggaa 
gcctgcagca 
tct.agggaca 
cctctggggt 
gcaacagggt 
ctgactctgg 
atctgcaccc 
aatcccaggg 
cccagctaag 
cggccatgtc 
cgcgactgaa 
tccatgccac 
gagctgcccc 
Gacat-ttgtc 
tagtgggcgc 
gccttggact 
agtggtcggg 
tacagcagag 
tgtaagttgt 
tccctcccca 
gcaaaattgc 
gaaatccttc 
tcaaaatctt 
atttagagtg 
gggttttcca 
tggaatgtgg 
taaggcagtg 
catagccaca 
tgctttatgc 
agaagaatct 
cccaacccct 
gagagctttg 
ccattcaccc 
tcctcgaatt 
agaggtgcct 
ctctttgaat 
ttctgttagt 
aatatgcaga 
tatgtgaatg 
taaaactaca 
tggtaaagct 
tgacaggagt 
ttgtcttttc 
ccaaagtcag 
aaataggggg 
Gtgttcccag 
ttgtgcGGtc 
agatgtcaag 
aacatGcaaa 



aaggggcatG 
-ctttttctac 
agtcactGGt 
GatGtgtGGt 
caggacaagg 
ttGtgaagac 
ggacagaggg 
cagggacaga 
tgttctggac 
GtggacggtG 
G.aactGGtgG 
ccctgttGtg. 
GGtatggagg 

gggtgatgga 

tggaggctgc 
aggactcaac 
gcGGttggac 
cccagtgccc 
CGcagcGCtt 
GaaaatctGG 
Gtgagcaaat 
tcaggaagga 
ctcctgttaa 
aaggctGaga 
agagatggga 

GtGtGGaCtG 

agggtgttca 
agctctaaca 
ttctggcaga 
acctcttggg 
tccaaaagag 
tttagagttg 
agggcctgta 
ggaattctgt 
gacagtccac 
tctgtcatta 

tGttttttGt 

atGGGGctaa 
gtgtggtgac 
ggtgctcGtt 
atccacctct 
aatgacctct 
tctccatgtg 
tacctGcaGG 
cagatgcttt 
ggaaatagaa 
aatgagagtc 
aaagttacac 
aaatcagata 
gcGGttcagt 
tGCGtttaaa 
Gtgcctgcaa 
ctagggagct 
gggcaccagc 
acaccagtgg 
aacttctgga 
accatgccac 



6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
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agaaccctgc 
ctgtgaccac 
agcgaggctt 
cctggtagtg 
cccttctgca 
agacccctgc 
ggagccgctc 
acaccagtgt 
cctgtgaaca 
cctaacgtgc 
gctgggaagc 
gggcaccatt 
cctaggtgcc 
cagtgagGcg 
aaaaaaaaag 
gagttaatga 
aaatatagtg 
ttaacatgat 
ggagagggaa 
ccaggtacag 
tcccagccaa 

gggggcacaa 

tgacccggcc 
gtccccaggg 

cacacataga 
caccagagaa 
gcagggttcc 
tccaggcatc 
cctcactcaa 
tcctccctca 
tgagcccaga 
tgaagccaca 
agaggaggac 
tcagcaggtg 
gacacacaga 
ggagtgagaa 
tcctcatgcc 
caaattgtaa 
tggattttct 
aagtgtgtag 
acgttcctgt 
agcagaagcc 
ctttataaat 
atcttctact 
tgcacagcgc 
ttggggacat 
ctcaccgagc 
tctagagatc 
tcctgtccac 
acctccctca 
cacgtcctcc 
atcggccgcc 
aggcagctcc 
agggcagctc 
tgct-gtcacc 
gcgcctgtct 
gtggaatccc 



ctggggtcta 
ctgccaagat 
ggaggtgctt 
gtgaggagcc 
tccccagcag 
accaatgaga 
ttccttgagg 
ctaagcccct 
gaatcctcct 
agccatgccc 
tgggagaaga 
tatgcctagg 
tctgggaggc 
agatcacacc 
agaaagatag 
ttaatagtat 
gtgaggatgt 
taatgtggca 
agcccaccat 
gcccctcatg 
ctgcaggtgc 
acctcagcac 
cactgtccag 
cactctgtgt 
tctgaccatt 
tcaggtacag 
tgcagtccac 
cctgacacag 
gctgtccaga 
cagggctcag 
gatggtgaca 
gatgggagaa 
acaggctctg 
aggccaggtc 
gtgtgtggtg 
gtgaggttga 
cctctctctc 
accccaatgt 
gctfctgatgc 
cacctctccc 
ttccccttca 
actatgcttc 
tacccagtct 
cccagatccc 
agcctcctgc 
ctgattgtga 
tgaaacctgg 
agtgttgagt 
cagagggcgg 
gccgggccca 
tcgccctccg 
cacctgaggc 
tgtcccctac 
cctgtgatct 
tcacagccct 
cccggttaca 
agggcccagc 



caacacatat 
gcccctaact 
cgccacaagg 
ttgggaccct 
gtgccccgct 
ccatgctgaa 
gccagggcag 
gatgagaaca 
tctgcagtga 
atctacccac 
catggaatac 
acacccacct 
cgaggcagga 
actgcactcc 
catcagtggc 
gaagtttcta 
agaatattgt 
tattttatct 
gtaaaataca 
acctgcacag 
acaacataaa 
tgccaggaca 
atcttgttgt 
gagcacacga 
aggcattgtg 
agaggaagac 
aggtccagct 
ctccctcccg 
gagggcagca 
ggctcagggc 
gggcaatgat 
gatggttcag 

tggggctgca 
ccat^gaacag 
ctgtgccccc 
cttgtccctg 
aaatatgatt 
tggaggtggg 
tgtttctgtg 
ctctctctct 
ccgtccagaa 
ctgtacaact 
caggtatttc 
cgcacacgct 
cgacaaaagc 
aagagggagg 
tagcactttg 
aaaacagcct 
cagagttcct 
cagcccagca 
agctcctcac 
ttgtcggccg 
accccctcct 
ccaggacagc 
gtcattacca 
ggaaaactct 
ccagtgcctg 



ggactgtgag 
gggatGcacc 
cagccGcaat: 
caggattact 
cttcagagcc 
gcctcagaga 
ggaaagcagg 

gggtggtctc 

caatgtctga 
tgccfcactgc 
ccatggcttc 
gccggcccca 
gaattgcttg 
agcctgggtg 
taccaagggc 
tgtgagatga 
gaatataatt 
tatgtatttg 
cccaccctaa 
gaataactaa 
tgtatctgca 
GacacGcttc 
gggattggga 
gacGtGGCca 
aggaggaGtG 
ggggGtcgag 
GaGGCtggtg 
gagGGtGGtG 
GCttggaGag 
tcaagtctGa 
GG aggggGag 
gaagaaaaa t: 
gGGGaggatg 
agaagcagct 
agagtcgggG 

GtGGtCtGtg 

tggatGtatg 
gGGttgtgag 
atagagatGt 

GtGtGtGtta 

tgattgtaag 

gcagaatgat 
tttatagcaa 
tagccccaga 
aaagtGaGaa 
aGagtaGaGt 
gcataacatg 
ggtctggggc 
cccaccctgg 
gggtGGaGGG 
acggaGtGtg 
GGGacttgag 
tccccgggct 
tcagtctctc 
ttaactcctc 
gtgaGaggga 
aGaGggaaGa 



GaGGaagtcc 
aatcactgGa 
ttgctgggag 
GGCGttaagG 
tGtGtctctg 
gagagatgga 
aggGagGaGG 
tcccatatgc 
gaggacgaca 
aggacagcac 
tGaGGttcct 
ggGtGttaag 
aacccgggag 
acagaatgag 
taggggcagg 
tgaaaatgtt 
aaGggGattt 
aGtaGatGGa 
tGagatagtG 
ggatttaagg 
aaGagaGtga 
tGgtggattG 
Gaagggaggt 

CCCGGCCaGG 

tagGgGgggG 
gagGtgatgg 
taggtgGGGG 
GGaggtgaGa 
cgcccaGCCC 
gaaGaaatgg 
GtgGGtgaaa 
ccaggaatgg 
ggaGtaagtg 

CGCaGGtGGG 
tGtGGtgttG 

GtaGGGGaaG 
tcGGGgccca 
aagtgattgg 
cacatgatct 
GtGatgGtGt 
ttttGtgagg 
gagcgaatta 
tgcgaggaca 
catcactgcc 
aaggtgaGaa 
tgtagGGaca 
tgGatgaGGG 
GgGtgGtgtG 
agcctGCCca 
tcaGGGgggt 

tGagGtGGtG 

gcctgtcggc 
cagctgaaag 
acaggctccg 
agtccGatga 
GGaGgtGtgt 
gatgGtGGat 



agGGGtgaat 


9840 


catcrcrGaQQG - 


9900 


tttGttggca 


9960 


a t acr tQcraqra 


10020 


aggtttaccc 


10080 


gGtttgacGa 


10140 


aaciacTtQQcra 


10200 


ccataGGagg 


10260 


tgtttGGGag 


10320 


caacccagga 


10380 


GcagtcGagt 


10440 


acrt t acrcitGa 


10500 


crcaciaQcrttQ 


10560 


actctgtGtG 


10620 


QQ a acrcr t crcra 


10680 


G tggaaaaaa 


10740 


aat.t:gt:aGaG 


10800 


agaaaGactig 


10860 


GtGattgtaG 


10920 


aGatgaggct 


10980 


gagt aaagc t 


11040 


tgaGtttatc 


11100 


Gataaacrcct 

Vi^GL WdCfcdi^^ ^^^^ ^1* 


11160 


QttacrcrtGtG 


11220 


tGagggatGa 


11280 


atgaGaGaga 


11340 


atGGGGGtga 


11400 


ca ti caodci ti c 


11460 


aGt^t^caG^Gt^ 


11520 


ct.^^ ^ ^ y 


11580 


Gcrcrcracrc aocf 


11640 


Q c aQQ acracrcr 


11700 


t:.gaagaGatG 


11760 


G t gat:gcacg 


11820 


tcrcrtGGGGacr 


11880 


attcaccttc 


11940 


aatctGatgt 


12000 


ataatgGggg 


12060 


ggttgtttaa 


12120 


gGGat:gt:.aag 


12180 


CGtGGGcagg 


12240 


aaGGtctttt 


12300 


gactaataca 


12360 


cc tcrcrcracica 


12420 


aaa t g tgGa t 


12480 


oaaa g tcraqq 


12540 


gtgttGaatg 


12600 


GGGaGttGGG 


12660 




12720 


GaGctcggGG • 


12780 


GGtgGagGGt 


12840 


tgccctGtgG 


12900 


ggcgtctGGG 


12960 


acgcccccta 


13020 


agttcaGtga 


13080 


GGtgGtCtGt 


13140 


aaataGtggt 


13200 
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taaatgtgtg 
gagtctgttc 
cccgcggctg 
cctgcaggga 
tcacaccacg 
cctcttgccc 
ggacaggaac 
caacagaccc 
accctgttct 
aaggtggggt 
gggagaacat 
cccctggagc 
tcctctcttt 
tgagacctct 
acagagggct 
acctctgtca 
gagacccggg 
agacccaagg 
ggatgggggt 
aaaaagagga 
aaagaccaca 
agggcagagg 
tggaactcaa 
ccctcggccc 
gggaggacaa 
ctgtgagacg 
gacaggagtc 
caagagttct 
actgcatcag 
cgcctaccgc 
gaactcatcc 



ggagatctct 
catgtggaca 
ccccaggaat 
atagatcctc 
tcgaccctgg 
tccagggggt 
ctggctttgc 
ttctgatctc 
gccaggggtc 
gatcacagga 
ggacaggccc 
cttccccaag 
ccaggacaca 
gggcctgggt 
ggtccctccc 
ccttcctgtt 
accctgctgg 
acagaacaca 
ccatccacct 
gggacaaaag 
cccatgaccc 
gaggaaggac 
gctcttctcc 
ctccccacag 
cctgggagag 
gacagagggc 
acaccagaaa 
attttcctag 
atgacactitt: 
ggacattgga 
acaggaatct 



aaaaagaagc 
caggggcact 
gaggcctcaa 
cggcctgacc 
tcagcatccc 
gacattgcac 
taaggaagtg 
tcccacatac 
cctgtgcaac 
cagtcagcct 
tgagccgcag 
gacagcagag 
caagacacct 
ctccatccct 
cagccaccac 
gggcatcatc 
gtttctctgt 
gcagaggtca 
tgccgaaaag 
aggcagaaat 
acgtgatgct 
agcagaccag 
acagaggagg 
atggtgcatc 
ggtgggagga 
tcctgttgga 
aatcaaattg 
t:t:aat:tgt:ca 
aaataaaaac 
aaataagccc 
gcagcctgtc 



atatcacctc 
ggcaccagca 
CGCccagagc 
ctgcagccta 
tagggcagtt 
acagacatca 
gaggtggagc 
ctgctctgtt 
tccagactcc 
cgcagagaca 
ctcagccaac 
cccagagt.ca 
ccccctccac 
gggtcagtgg 
ccagtgagcc 
ccaccttccc 
cacaaaggaa 
gcactgggga 
atttgtctga 
gagaggggag 
gagaagtact 
acagtcacag 
acagagcaga 
ccctggcaga 
gggagctggg 
gcctgaatag 
aactggaatt 
ctggccacta 
at:aaccaggg 
caggctgtgg 
ccaggcactg 



cgtgtggccc 
tgggaggagg 
ttcagaaggg 
atccagagtt 
ccagacaagg 
ctcaggaaac 
ctggtttcca 
cctttctggg 
ctcctggtac 
gagaccaccc 
agacacggag 
cccacctccc 
atgcaggatc 
cggggttggt 
tttttctagc 
agagccctgg 
aataatcccc 
agacaggttg 
ggaactgaaa 
gggacagagg 
cctgccctag 
cagccttgac 
cagcagagac 
ggctcctgct 
gtctcctggg 
ggaagaggac 
ggaaaggggc 
cgtttttaaa 
catgaaacac 
agggccctgg 
gggtgcaacc 



ccagcagtca 
ccagcaagtg 
aggacagagg 
cagggtcagc 
ccggaggtct 
ggattcccct 
tcccttgctc 
tcctatgagg 
caccatgggg 
aggactgtca 

tccaccacag 
tggggactcc 
ggtactggag 
ccccagagcc 
agagcatggg 
ctggtgtgac 
tcctcccagg 
atagaaggga 
acacctgaat 
gaagagactc 
aaaacgttcc 
catggagtct 
cacaggtgaa 
taggacaggg 
atcagagagg 
aggaaaacct: 
aatcataata 
tgtcctcatc 
gaaccc^cat: 
aagatc 



13260- 
.13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13 920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15056 



<210> 15 
<211> 858 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Mucin- TRE 



<400> 15 

cgagcggccc 
gggaggagct 
cccaggttta 
gagggaaccc 
cggggtgtgg 

ggggagggag 
ccgctctgct 
gcttcccgac 
tagttgttgc 
ccccctcccc 
gggtagtcag 
gaggaggaag 
ctgtgcctag 
gcccgctcca 
catttcacca 



ctcagcttcg 
cctggccagt 
ctcccttaag 
aggctgctgg 
aacgggacag 
cccaaaacta 
tcagtggacc 
cttgctgtac 
cctgaggcta 
cggagccagg 
ggggttgagc 
agg t aggagg 
ggcgggcggg 
cctctcaagc 
ccaccatg 



gcgcccagcc 
ggtggagagt 
tggaaatttc 
aaagtccggc 
ggagcggtta 
gcacctagtc 
cggggagggc 
aggacctcga 
aaactagagc 
gagtggttgg 
gattagagcc 
taggggaggg 
cggggagtgg 
agccagcgcc 



ccgcaaggct 
ggcaaggaag 
ttcccccact 
tggggcgggg 
gaagggtggg 
cactcattat 
ggggaagtgg 
cctagctggc 
ccaggggccc 
tgaaaggggg 
cttgtaccct 
ggcggggttt 

ggggaccggt 

tgcctgaatc 



cccggtgacc 
gaccctaggg 
cctccttggc 
actgtgggtt 
gctattccgg 
ccagccctct 
agtgggagac 
tttgttcccc 
caagttccag 
aggccagctg 
acccaggaat 
tgtcacctgt 
ataaagcggt 
tgttctgccc 



actagagggc 
ttcatcggag 
tttctccaag 
caggggagaa 
gaagtggtgg 
tatttctcgg 
ctaggggtgg 
atccccacgt 
actgccGctc 
gagaacaaac 
ggttggggag 
cacctgctcg 

^ggcgcctgt 

cctccccacc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
858 
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<210> 16 
<211> 5224 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> AlphaFP-TRE 



<400> 16 

gaattcttag aaatatgggg gtaggggtgg tggtggtaat tctgttttca ccccataggt 6 0 

gagataagca ttgggttaaa tgtgctttca cacacacatc acatttcata agaattaagg 120 

aacagactat gggctggagg actttgagga tgtctgtctc ataacac.ttg ggttgtatct . 180 

gttctatggg gcttgtttta agcttggcaa cttgcaacag ggttcactga ctttctcccc 240 

aagcccaagg tactgtcctc ttttcatatc tgttttgggg cctctggggc ttgaatatct 300 

gagaaaatat aaacatttca ataatgttct gtggtgagat gagtatgaga gatgtgtcat 360 

tcatttgtat caatgaatga atgaggacaa ttagtgtata aatccttagt acaacaatct 420 

gagggtaggg gtggtactat tcaatttcta tttataaaga tacttatttc tatttattta 480 

tgcttgtgac aaatgttttg ttcgggacca caggaatcac aaagatgagt ctttgaattt 540 

aagaagttaa tggtccagga ataattacat agcttacaaa tgactatgat ataccatcaa 600 

acaagaggtt ccatgagaaa ataatctgaa aggtttaata agttgtcaaa ggtgagaggg 660 

ctct tctcta gctagagact aatcagaaat . acattcaggg. ataattattt gaatagacct ... 72 0 
taagggttgg gtacattttg ttcaagcatt gatggagaag-gagagtgaat atttgaaaac - 780. 

attttcaact aaccaaccac ccaatccaac aaacaaaaaa tgaaaagaat ctcagaaaca 840 

gtgagataag agaaggaatt ttctcacaac ccacacgtat agctcaactg ctctgaagaa 900 

gtatatatct aatatttaac actaacatca tgctaataat gataataatt actgtcattt 960 

tttaatgtct ataagtacca ggcatttaga agatattatt ccatttatat atcaaaataa 1020 

acttgagggg atagatcatt ttcatgatat atgagaaaaa ttaaaaacag attgaattat 1080 

ttgcctgtca tacagctaat: aattgaccat aagacaatta gatttaaatt agttttgaat 1140 

ctttctaata ccaaagttca gtttactgtt ccatgttgct tctgagtggc ttcacagact 1200 

tatgaaaaag taaacggaat cagaattaca tcaatgcaaa agcattgctg tgaactctgt 1260 

acttaggact aaactttgag caataacaca catagattga ggattgtttg ctgttagcat 1320 

acaaactctg gttcaaagct cctctttatt gcttgtcttg gaaaatttgc tgttcttcat 13 80 

ggtttctctt ttcactgcta tctatttttc tcaaccactc acatggctac aataactgtc 1440 

tgcaagctta tgattcccaa atatctatct ctagcctcaa tcttgttcca gaagataaaa 1500 

agtagtattc aaatgcacat caacgtctcc acttggaggg cttaaagacg tttcaacata 1560 

caaaccgggg agttttgcct ggaatgtttc ctaaaatgtg tcctgtagca catagggtcc .1620 

tcttgttcct taaaatctaa ttacttttag cccagtgctc atcccaccta tggggagatg 1680 

agagtgaaaa gggagcctga ttaataatta cactaagtca ataggcatag agccaggact 1740 

gtttgggtaa actggtcact ttatcttaaa ctaaatatat ccaaaactiga acatgtactt 1800 

agttactaag tctttgactt tatctcattc ataccactca gctttatcca ggccacttat 1860 

ttgacagtat tattgcgaaa acttcctaac tggtctcctt atcatagtct tatccccttt 1920 

tgaaacaaaa gagacagttt caaaatacaa atatgatttt tattagctcc cttttgttgt 1980 

ctataatagt cccagaagga gttataaact ccatttaaaa agtctttgag atgtggccct 2 04 0 

tgccaacttt gccaggaatt cccaatatct agtattttct actattaaac tttgtgcctc 2100 

ttcaaaactg cattttctct cattccctaa gtgtgcattg ttttccctta ccggttggtt 2160 

tttccaccac cttttacatt ttcctggaac actataccct ccctcttcat ttggcccacc 2220 

tctaattttc tttcagatct ccatgaagat gttacttcct ccaggaagcc ttatctgacc 2280 

cctccaaaga tgtcatgagt tcctcttttc attctactaa tcacagcatc catcacacca 2 34 0 

tgttgtgatt actgatacta ttgtctgttt ctctgattag gcagtaagct caacaagagc 2400 

tacatggtgc ctgtctcttg ttgctgatta ttcccatcca aaaacagtgc ctggaatgca -2460 

gacttaacat tttattgaat gaataaataa aaccccatct atcgagtgct actttgtgca 2520 

agacccggtt ctgaggcatt tatatttatt gatttattta attctcattt aaccatgaag 2580 

gaggtactat cactatcctt attttatagt tgataaagat aaagcccaga gaaatgaatt 2640 

aactcaccca aagtcatgta gctaagtgac agggcaaaaa ttcaaaccag ttccccaact 2 700 

ttacgtgatt aatactgtgc tatactgcct ctctgatcat atggcatgga atgcagacat 2760 

ctgctccgta aggcagaata tggaaggaga ttggaggatg acacaaaacc agcataatat: 2820 

cagaggaaaa gtccaaacag gacctgaact gatagaaaag ttgttactcc tggtgtagtc 2880 
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gcatcgacat 
ttatttgtag 
ttactaaagg 
agatgatcag 
tatatctata 
aaatatgttt 
gtaacccctt 
aaatgttgct 
catgcttggg 
aatacttcat 
tatttgagta 
tggaatccaa 
gcctgttttg 
aatggacaaa 
tatactatga 
taaatgtgtg 
tttaaaaatt 
agat:agaatt 
tccattattt 
gtacaatgat 
acaatatatt 
ggctgtatat 
tttgatttta 
ggtgcctggg 
ttgggaattc 
tagacaggga 
catcttgagt 
taaaattatt: 
at:acatcgca 
tgtttctgag 
attattctgc 
gttatatttg 
tgataggcat 
gctgtctttt 
tagggacaag 
aaaataaaat 
ataaaataca 
ggaaatacca 
gaaaagagta 
aaca 



cttgatgaac 
ttgtgtgtgt 
cctaccattt 
attggattac 
tctatatcta 
tgtaaagtta 
tctctacatt 
gagtcctggc 
caagcttata 
gactattgct 
aagtccccct 
acggatagac 
ttaacatgtt 
aactaacaaa 
taaatgtttg 
ccctagtagc 
ttatgtttaa 
aaattcactt 
ctgtctttta 
acacaaatga 
atgagcatat 
caccctactt 
taatcttacc 
tctcaactcc 
aaaaagttcc 
tgataggagg 
tgctaagtga 
ataggacttg 
tactaaatac 
ctaaacaatg 
aacttaggga 
caagatgaag 
ttaatagttt 
ctttatggct 
tcatgtcttt 
aagtttgcaa 
aataaccgct 
taaagtaaca 
taaaagaatt 



tggtggctga 
atttttatat 
gccaggcatt 
ttggccttga 
tatctatatc 
taaagatttc 
aaaccca1::gt 
tatgaaccag 
tagtttgctt 
tttcaggtat 
tgaggaagag 
aaggatggtg 
ccttaggaga 
tgaatgggaa 
tatttgctgg 
ttgcagtatg 
aattgcatag 
atcttaccat 
ttcaacattt 
gagcactctc 
ttccatttca 
tatgaatgta 
tttagaataa 
acagattctg 
cccagtgatt 
tgagccactc 
atgatgcacc 
gtttattagg 
tttaaaatgt 
acagcatiaat 
caagtcatct 
agtctgaatt 
taaagaatta 
tcattiaactt: 
gaatattctg 
gttttttttt 
atgctgttaa 
gatataccaa 
tcagcatgat 



cacaacatac 
atatatttg.t 
tttacatttg 
agatgatata 
tatatctata 
agactttata 
tggaacaaat 
acactgtgaa 
cataaaactc 
tccttcataa 
tagaagaact 
ctacctcttt 
caaaacttag 
ttgtacttga 
aagtgctact 
atctattttt 
tgctctttca 
ctagagaaac 
tttttagagg 
catgtattgt 
ttaaatattc 
ccat:att:aat 
tgaaacacct 
at:t:taact:gg 
ctaatgtgta 
ttagcatcca 
tgacccactt 
gcttgtgctc 
accttattga 
tat:caagcaa 
ctctgaattt 
ggtcagacaa 
atgtatttag 
aatttgagag 
tagtttgagg 
tctgccccaa 
ttattggcaa 
caaaaggtta 
tttccatatt 



attggcttga 
aatat.tgaaa 
tcccctctaa. 
tctacatcta 
tatgtatatc 
gaatctggga 
acatttatta 
agcctttggg 
tatttcagtt 
caaatacttt 
gcactttgta 
ctggagagta 
gagagacacg 
ttagcattga 
gacggtaaac 
taagtactgt 
ttgaagaagt 
ccaatgttaa 
gtgggaggaa 
tttgtcctgt 
ttccacaaag 
ttatttcctg 
gtgaagcttt 
tctgggttac 
gccaagatcg 
tcatttagta 
tataaagaca 
taagttttct 
catacatatit 
tgat:aattt:g 
tttgtacttt 
tgtcttgtgt 
atgaattgca 
aaattaatt:a 
agaatatttg 
agagctctgt 
atgtcccatt 
ctagttaaca 
gtgcttccac 



tgtgtacata 

tagtcataat.. 

tcttttgatg . 

tatctatatc 

agaaaagctg 

tttgccaaat 

ttcattcatc 

atattttgcc 

cttcataact 

ggctttcata 

aatactatcc 

cgtgagcaag 

catagcagaa 

agaccttgtt 

cctttttgtt 

acttagctta 

tttgagagag 

aactttgttg 

tacagaggag 

ttttcagtta 

ttattttgat 

gtgtgggtta 

agaaaatact 

agactaggca 

ggaacccttg 

ttaacatcat 

catgtgcaaa 

atgttaagcc 

aagtgaaaag 

aaatgaattt 

gagagtattt 

gcctggcata 

taccaaatct 

ttctgcaact 

ttatatttgc 

gtccttgaac 

ttcaacctaa 

ggcattgcct 

cactgccaat 



2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
42 00 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5224 



<210> 17 
<211> 307 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Nucleotide sequence for ADP 



<400> 17 

gatgaccggc 

aacatctgcc 

catgtggtgg 

ttgcctaaag 

acacaatgaa 

tgattaa 



tcaaccatcg 
ctaaatttac 
ttttccatag 
cgcagacgcg 
aaaattcata 



cgcccacaac 
cccaagttca 
cgcttatgtt 
ccagaccccc 
gattggacgg 



ggactatcgc 
tgcctttgtc 
tgtttgcctt 
catctatagg 
tctgaaacca 



aacaccactg 
aatgactggg 
attattatgt 
cctatcattg 
tgttctcttc 



ctaccggact 
cgagcttgga 
ggcttatttg 
tgctcaaccc 
ttttacagta 



60 
120 
180 
240 
300 
307 



18 



<210> 18 
<211> 101 
<212> PRT 

<213> Artificial Sequence - 

<220> 

<223> Amino acid sequence for ADP 
<400> 18 



Met 


Thr 


Gly 


Ser 


Thr 


lie 


Ala 


Pro 


Thr 


Thr 


Asp 


Tyr Arg 


Asn 


Thr 


Thr 


1 








5 










10 










15 




Ala 


Thr 


Gly 


Leu 
20 


Thr 


Ser 


Ala 


Leu 


Asn 
25 


Leu 


Pro 


Gin 


Val 


His 
30 


Ala 


Phe 


Val 


Asn 


Asp 

35 


Trp 


Ala 


Ser 


Leu 


Asp 

40 


Met 


Trp 


Trp 


Phe 


Ser 

45 


He 


Ala 


Leu 


Met 


Phe 
50 


Val 


Cys 


Leu 


He 


He 
55 


Met 


Trp 


Leu 


He 


Cys 
60 


Cys 


Leu 


Lys 


Arg 


Arg 


Arg 


Ala 


Arg 


Pro 


Pro 


He 


Tyr 


Arg 


Pro 


He 


He 


Val 


Leu 


Asn 


Pro 


65 










70 










75 










80 


His 


Asn 


Glu 


Lys 


lie 


His 


Arg 


Leu Asp 


Gly 


Leu 


Lys 


Pro 


Cys 


Ser 


Leu 










85 










90 










95 




Leu 


Leu 


Gin 


Tyr 
100 


Asp 

























<210> 19 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR EMCV IRES (PCR primer 96.74.2) 
<400> 19 

gacgtcgact aattccggtt attttcca 

<210> 20 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR EMCV IRES (PCR primer 96.74.1) 

<400> 20 

gacgtcgaca tcgtgttttt caaaggaa 

<210> 21 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ad5 sequence to 1314 to 1338 (PCR primer 96,74.3) 
<400> 21 

cctgagacgc ccgacatcac ctgtg 



19 



<210> 22 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Antisense of Ad5 sequence 1572 to 1586 (PGR primer 96.74.6) 
<400> 22 

gtcgaccatt cagcaaacaa aggcgttaac 30 

<210> 23 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ad5 sequence 1714 to 1728 (PGR primer 96.74,4) 
<400> 23 

tgctgaatgg tcgacatgga ggcttgggag 30 

<210> 24 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Antisense of Ad5 sequence 2070 to 2094 (PGR primer 96.74.5) 
<400> 24 

cacaaaccgc tctccacaga tgcatg 26 

<210> 25 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Human UPII (PGR primer 127.2.1) 

<400> 25 

aggaccggtc actatagggc acgcgtggt 29 

<210> 26 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Human UPII (PGR primer 127.2.2) 
<400> 26 

aggaccggtg ggatgctggg ctgggaggtg g 31 

<210> 27 
<211> 29 



20 



^212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 100.113.1 



<400> 27 

aggggtaccc actatagggc acgcgtggt 



29 



<210> 28 
<211> 32 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> PGR primer 100,113.2 



<400> 28 

acccaagctt gggatgctgg gctgggaggt gg 



32 



<210> 29 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 127.50.1 



<400> 29 

aggaccggtc aggcttcacc ccagacccac 



30 



<210> 30 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 31.166.1 



<400> 30 

tgcgccggtg tacacaggaa gtga 



24 



<210> 31 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 32.32.1 



<400> 31 

gagtttgtgc catcggtcta c 



21 



<210> 32 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



21 



<220> 

<223> PGR primer 32.32.2 
<400> 32 

aatcaatcct tagtcctcct g 

<210> 33 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<:220> 

<223> PGR primer 51.176 
<400> 33 

gcagaaaaat cttccaaaca ctccc 

<210> 34 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 99.120.1 
<400> 34 

acgtacaccg gtcgttacat aacttac 

<210> 35 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 99.120.2 
<400> 35 

ctagcaaccg gtcggttcac taaacg 



